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We analyze the effect of interference on the convergence rate of average consensus algorithms, 
which iteratively compute the measurement average by message passing among nodes. It is usually 
assumed that these algorithms converge faster with a greater exchange of information (i.e., by in- 
creased network connectivity) in every iteration. However, when interference is taken into account, 
it is no longer clear if the rate of convergence increases with network connectivity. We study this 
problem for randomly-placed consensus-seeking nodes connected through an interference-limited 
network. We investigate the following questions: (a) How does the rate of convergence vary with 
increasing communication range of each node? and (b) How does this result change when each 
node is allowed to communicate with a few selected far-off nodes? When nodes schedule their 
transmissions to avoid interference, we show that the convergence speed scales with r 2 ~ d , where 
r is the communication range and d is the number of dimensions. This scaling is the result of two 
competing effects when increasing r: Increased schedule length for interference-free transmission 
vs. the speed gain due to improved connectivity. Hence, although one-dimensional networks can 
converge faster from a greater communication range despite increased interference, the two effects 
exactly offset one another in two-dimensions. In higher dimensions, increasing the communication 
range can actually degrade the rate of convergence. Our results thus underline the importance of 
factoring in the effect of interference in the design of distributed estimation algorithms. 
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I. Introduction 



A. Motivation 

The advent of wireless sensor and ad hoc networks has motivated the need for distributed 
information processing algorithms, which allow each node to operate only on local information. 
A well-studied algorithm that allows distributed averaging is the average consensus algorithm, 
wherein the global average of a set of initial sensor observations can be computed based on 
purely local computations at each sensor. Starting from a set of initial measurements, the average 
consensus algorithm allows a set of nodes to communicate by a (possibly time-varying) topology to 
iteratively compute the global average of the initial measurements, see e.g., H]]-0 and references 
therein. The connectivity properties of the topologies that ensure convergence have been well- 
studied (e.g., iflOll . ifm *). Of late, the focus has shifted to studying convergence in the face of 
communication constraints, like quantization lfT2l -lfT411. packet drops |[T5l and noise 1 16]. A closely 
associated algorithm is the gossip algorithm (2), ifTTl , ||T~8ll . In particular, the recent work |fl"8l 
proposes and studies a probabilistic version of the broadcast gossip algorithm [17]. The idea is to 
exploit channel fluctuations to enable opportunistic longer-range message-passing. Since only one 
node is allowed to transmit at any given time, the question of interference does not arise. 

In this paper, unlike prior work, we study the effect of interference, which becomes important 
in the formation of more general message -passing topologies. We explicitly model the effect of 
interference on the rate of topology formation — and hence convergence — of the average consensus 
algorithm. This important effect — which crucially depends on network geometry — has been largely 
ignored. In wireless networks, depending on the physical proximity of a to d and c to b, the 
transmission from a to b and c to d may interfere with one another; hence two time slots may be 



needed to establish edges (a, b) and (c, d). The network thus has two time-scales of interest: that of 
establishing individual communications among the desired set of nodes and that of the iterations 
of the distributed algorithms, which occur only when all the desired nodes have successfully 
communicated. One may thus, view the underlying communication network as constructing the 
desired message passing graphs from several feasible sub-graphs, each of which satisfies half- 
duplex, fading and interference constraints. The union of all these sub-graphs is the desired message 
passing graph. 

To illustrate this, consider the formation of a simple linear 6-node network shown in Fig. [T] 
Suppose the estimation algorithm requires nearest-neighbor communication (shown as bidirectional 
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edges). However, due to interference constraints, only every third node can transmit. In this case, 
we see that forming the the desired topology requires at least three time-slots, as shown. In other 
words, for these interference constraints, this topology's fastest rate of formation is three time 
slots. Clearly, a topology's intrinsic benefit and the fastest rate of its formation determine its true 
utility. 



Desired 

Message-Passing 0<-^>0<-^0<-^>0<-^>0<r->0 

Graph 



Time Slots 




Feasible 
Communication 
Graphs 



Figure 1. An example illustrating the constraints introduced by interference. If the nodes are physically placed as shown, 
interference limits the number of nodes that can communicate concurrently. Assuming a spatial re-use factor of two, 
the message-passing graph can be formed as a union of three feasible sub-graphs, each of them satisfying interfefence 
constraints. We consider this TDMA schedule feasible. 



The performance of the underlying (real-time) estimation algorithm is therefore coupled with 
algorithms for channel access and routing. In our previous work |fT9l , we studied the coupling with 
channel access for the average consensus algorithm for a certain class of deterministic network 
topologies. Using a simple protocol model |[20l for reception, we were able to show that the effect 
of increasing network connectivity depends crucially on its dimension. In our recent work ||2T1 we 
exploited the well-known parallels between the convergence of the average consensus algorithm and 
Markov chain mixing (e.g., and the references therein) to study consensus on disk graphs ll22l 
using the more refined physical model. We examined the scaling behavior of the fastest rate 
of topology formation with interference, captured by the shortest feasible TDMA schedules that 
construct the graph. 

We note here that implementing inter-node communication in a network will require some addi- 
tional overhead. For example, one possible protocol that establishes point-to-point communication 
can have nodes tag their packets with their uniquely assigned address. A receiver reads this address 
and decodes a packet only if the address is that of one of its intended transmitters. In this work, 
we neglect this additional overhead. However, we show that even when this overhead is neglected, 
increased interference alone is enough to significantly lower the rate of topology formation. 
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B. Main Contributions 

In this paper, we study networks with short-range and networks with both short-range and 
limited long-range communication. Although remarkable improvements in convergence rate have 
been reported |[23l - |[25l for consensus on graphs with a few long-range edges (as in small-world 
graphs 1126*1 "). it is not clear if these benefits will carry over to a wireless setting, where long- 
range links come at a cost of increased interference. Motivated by this fact, we study the average 
consensus problem in graphs formed by overlaying long-range edges onto an existing "short-range" 
disk graph. We derive the scaling law for the spectral gap as well as that of the fastest rate of 
topology formation in the presence of interference. To the best of our knowledge, this is the first 
such attempt. 

We find that the spectral gap scales quadratically in the communication range r, independently 
of the network dimension d, but the length of the shortest TDMA schedule that constructs such 
graphs scales as r d . Thus when interference is factored in, the benefit of a greater communication 
range depends crucially on the network dimension: 

• For one-dimensional networks (d = 1), topologies with increased communication range can 
converge faster despite greater interference. 

• For two-dimensional networks, the rate of convergence scales independently of the commu- 
nication range. 

• For three- (and higher-) dimensional networks, increasing the communication range can ac- 
tually slow down convergence. 

Furthermore, these results hold whether each node only communicates with all other nodes within 
its communication range, or, additionally, with a small number of far-away nodes. Thus our results 
significantly change many optimistic results obtained by analyzing the consensus problem in an 
abstract graph-theoretic setting. 

The remainder of this paper is organized as follows. In Section UH we provide some standard 
definitions and results used in this paper. In Section |IIIJ we specify our system model and formulate 
the problem using the terminology developed in Section ITT] In Section [TVl we discuss convergence 
results for the disk graph model. In Section [V] we study the effect of selective long-range com- 
munication and provide the relevant scaling results. Section [VI] concludes the paper. 
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II. Definitions and Notation 

To make this paper self-contained, we formally state the following standard definitions and facts 
about Markov chains and introduce some notation and other relevant terminology. 

1 ) Basic Definitions from Markov Chain Theory: Consider a connected undirected graph G, 
with n vertices V = {1, 2, . . . ,n} and a set of edges E. We assume G also contains all self-loops, 
i.e., i G V G E. Let di denote the degree of vertex i. For more information, see IT271 . 

Definition 1. (Random walk on a graph) A random walk X(G) = {X^)^^ on V is characterized 
by the n x n transition probability matrix P(G) = [pij], with p^j = P(Xk+\ = i | X^ = j), and 
Pij > only if E E, with = 1 Vt G V. 

Observe that P is stochastic. 

Definition 2. (Symmetric random walk) A random walk is symmetric if = pji. 

For a symmetric random walk P is doubly stochastic. 

Fact 3. A random walk on G is a Markov chain with state space V. Given an initial distribution 
7r(0) over V, the distribution 7v(k+l) after k+1 steps satisfies n(k+l) = P7v(k)for k = 0, 1, . . . 

Definition 4. (Stationary distribution of a Markov chain) A stationary distribution n* satisfies 
7r* = P-7T*, i.e., remains invariant with time. 

Definition 5. (Reversible Markov chain) A Markov chain X = (Xk)k&z is said to be reversible if 
for all states i, itjPij = n*Pji- 

Fact 6. An irreducible and aperiodic Markov chain has a unique stationary distribution. 
Definition 7. (Natural random walk) A natural random walk on G is a random walk with 



Pij 



l/2di, (i,j)£E,i^j 
1/2 i = j. 



Fact 8. The natural random walk is reversible, irreducible and aperiodic with a unique stationary 
distribution ir* = • When G is regular, a natural random walk is also symmetric and has a 
uniform stationary distribution. 
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Definition 9. (Mixing time of a random walk) For a random walk X with a unique stationary 
distribution it*, consider the Total Variational (TV) distance^ (cf. E71 Chap. 4]) c?tv ,i(t; tto) — 
\ Si I^P^t = *)7To) ~~ vr* | for an initial distribution ttq. Then the mixing time of X is defined as 

T mix (e;P) 4 SU pmf{i : d T v(*; P, tt(0)) < e}. 

tt(O) 

2) Asymptotic Notation: We use the following asymptotic notation. For two functions / and g 
of a variable n, as n — > oo, we write 

• g = 0(f) if the ratio #// is asymptotically finite. Further, g = o(f) if this limit is zero. 
. g = (1(f) if / = 0(g). Further, g = u(f) = o(<?). 

. 5 = 8(/) if 5 = and 5 = n(/). 
When / and g are random, these relations are defined to hold with probability one. 

3) Graph Sequences and the Asymptotic Regime: Consider a sequence of (possibly random) 
undirected graphs (G n ), whose n th member G n has n vertices V n = {1,2, ... ,n} and a set of 
edges E n . We assume each graph contains all self-loops. Denote the maximum and minimum node 
degrees of G n by d max (G n ) (shortened to d max ) and d m j n (shortened to d m i n ) respectively. We 
provide some standard definitions below. 

Definition 10. (Asymptotically regular graph) G n is asymptotically regular if d mSLX (G n )— <i mm (G n ) = 
o(l). 

Definition 11. (Asymptotically almost sure validity) A property V is true asymptotically almost 
surely (a.a.s.) for a sequence of random objects (X n ), if lim n ^oo P (X n has property V) = 1. 

We obtain scaling results for the convergence of the average consensus algorithm in large 
networks by mapping the problem to the scaling of mixing times of natural random walks on 
a sequence of graphs that are connected and regular asymptotically almost surely. 

III. Problem Formulation 

A. Average Consensus and Random Walks 

Consider a set of sensor nodes V n = {1,2, ... ,n}. Associate with the i th sensor an initial 
observation Zi(Q). Given a realization of a random message-passing graph G n with vertices V n 

'The TV distance between two distributions fi and v over a countable set S is defined as v^tV — \ 23jp<s \t L i~ v i\ 
(essentially the i\ norm). 
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and edges E n , suppose that all the vertices i S K synchronously update their observations as 

Zi(k + l) = ~Zi(k) + ±- £ (zj(k) - Zi(k)), A; = 0,1,... (1) 

1 j6JV 4 (G n ) 

Here Ni{G n ) denotes the neighborhood of vertex i in G n . By stacking the individual observations 
Zi to form the observation vector z, the (A; + l) th update starting from an initial observation vector 
z(0) can be written as 

z(k + 1) = W n z(k). (2) 

where we have defined the update matrix W n = (I n — A n L n )/2, where l n denotes the n x n 
identity matrix, A n = diag [ci^ -1 ] and L n is the graph Laplacian. Notice that W n depends on the 
realization of the random graph G n , which remains the same for all iterations. We will analyze 
the speed of convergence for specific families of random graphs in the scaling limit n — > oo, by 
deriving properties of interest that hold a.a.s. for all realizations of G n . 

Without loss of generality, let £j(0) > 0, and define ^(0) = £i(0)/J^i ^i(O) as the normalized 
initial observation vector. In the light of Fact [3] and Definition |7J the iteration z'(k + 1) = W n z'(k) 
can now be interpreted as time-evolution of the node occupancy distribution of a natural random 
walk over G n with a transition probability matrix W n (2), ll23l . 

If G n is also connected, this equivalence with a natural random walk ensures (from Fact [6]) that 
the value of each vertex asymptotically reaches ~ ^ Zj(0) = 1 T ^- (a more general result for a 
time-varying case was studied in ifTOlO . Interpreting each vertex as a sensor and the initial values 
{zi(0))i£v n as sensor measurements, this algorithm allows each sensor to iteratively compute the 
average i J2i z «(0) of the initial measurement set by exchanging messages as described in CD). We 
will sometimes also refer to G n as the message-passing network. 

The rate of convergence of (O to its steady state value can be understood in terms of the mixing 
time of the natural random walk described by W n . Indeed, by expressing z[ in terms of Zj, we 
can write from Definition |9j 

Tmix(e; W n ) = supinf{fc : \\z(k) - n _1 lz ||TV < e^o} (3) 

z(0) 

where z = 

When G n is a.a.s. connected and regular, we know from Fact [8] that the stationary distribution 
of the random walk is uniform a.a.s., thereby implying convergence to average consensus a.a.s. 
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In this paper, we analyze random graphs based on the disk graph E2I . which are parameterized 
by the disk radius (see Section ITlI-B|) . For this family of graphs, it is well-known that the graphs are 
a.a.s. connected if and only if the radius remains large enough with n (i.e., in the "supercritical" 
regime |[28l . see, e.g., [29] for a proof). In this regime, the asymptotic regularity property was 
formally shown to hold a.a.s. in (2j Lemma 10]. In fact, in O these two properties were used 
to establish scaling laws for the mixing time of both the natural and the fastest mixing reversible 
random walks on these graphs to the uniform distribution. 

It is well-known that the mixing time of a random walk can be characterized by the second- 
largest eigenvalue of W n . Denoting the eigenvalues of W n by p,\ = 1 > > ■ ■ ■ > /J- n > 0> 
the asymptotic convergence of the iteration © is determined by p,2- The result below formally 
establishes this dependence: 

Theorem 12. [30]. The e— mixing time of a random walk with a doubly stochastic positive definite 
transition matrix W n on a connected graph G n is bounded as 

^!? S(2e) : 1 <T m Ue;W n )< l °^- lOSe , 
2(1 - fj, 2 ) 1- 112 

where 1 — ^2 is called the spectral gap of G n . 

Remark: Observe that the spectral gap controls the mixing time. In the scaling limit n — > oo, the 
scaling of e also becomes important. The logarithmic dependence on e _1 suggests three meaningful 
possibilities: 

1) Polynomial scaling: e = l/n s for some fixed S > 0. 

2) Exponential scaling: e = exp(— 5'n) for some fixed 5' > 0. 

3) Constant error: e <^ 1 is constant. 

For polynomial and exponential error scaling, it is clear that the bounds in Theorem [12] are of 
the same order, and are 0((1 — l^2)~ 1 logn) and 0((1 — //2) -1 ^) respectively. For constant error, 
the upper bound scales logn times faster than the lower bound, i.e., T m i x = f2((l — /X2) _1 ) and 
r = 0((l-^ 2 )- 1 logn). 

In the sequel we assume polynomial scaling, as was done in Q. It will become clear in the 
later sections that the scaling laws for exponential scaling follow from a substitution log ih->r. 

Spectral Gap and Cheeger's Inequality: 

Intuition suggests that the mixing time of a Markov chain depends on how "easy" it is to move 
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out of any specified region in the state space. This property can be formalized with the notion of 
conductance. The conductance of a reversible Markov chain on a state space Q, = V on a graph 
G n with an equilibrium distribution n* is defined as follows (3TJ: 

h = mm 77^, (4) 

Scn,-n-(S)<i/2 vr*(6) 

where tt*(S) 4 £ ieS 7T*(*) and S = fi\5, and Q(S,S) = E, 68 , ie ^*(^ft+i = j|^n = i). 
Viewed in graph-theoretic terms, the numerator dl} measures the effective weighted flow across 
the cut (S, S), while the denominator measures the weighted capacity of S. Intuitively, we would 
expect a larger conductance to correspond to a smaller mixing time, or equivalently from Theorem 
[T2l a larger 1 — fi2 of the underlying graph G n . This is indeed the case, as Cheeger's Inequality 
shows: 

Theorem 13. / |30[/ . The spectral gap of a reversible Markov chain satisfies 

h? 

— < 1 - [12 < 2h, 
where h is the conductance of the Markov chain. 

Once we know how h scales with n for a (random) sequence of graphs (G n ), we can use 
Theorem [13] to find the scaling law for their spectral gap. This, in turn, permits the use Theorem 
[12] in deriving scaling laws for the mixing time for iterations of the form ([2]) on these sequences of 
graphs. In the following, motivated by the need to capture the distance-dependence and randomness 
in the connectivity of the nodes, we present random geometric graph models for G n . 

B. Network Models 

Each point i G {1,2,... , n} is placed uniformly randomly in a d— dimensional torus Td on 
[0, l] d , i.e., the vertices form a binomial point process |[32l $ = {xi}, i = 1,2, ... n, on Td- Each 
element of (G n ) is based on the well-known disk graph model [22], ll28l . In the following let 
bd(x,r) = b(x,r) denote a Euclidean ball centered at x G M. d and radius r, and |6(a;,r)| denote 
its volume. 

1) Networks with Short-Range Communication: In this case, G n is the d— dimensional disk 
graph parameterized by the common communication range r of each node. The neighborhood of 
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node Xi € $ that will be used for implementing (Q]) is 

iV Xi (r) = { Xj <E $ : - Xf[| < r}, 

where || • || denotes the Euclidean norm. In this paper, we will always operate in the super-critical 
regime, i.e., r = uj(r c ), where r c = (^p) 1 /^ to ensure asymptotic connectivity and regularity of 
(G n ) [20]. We label this family of graphs as G^(r,d) = G^ h (r), and the update matrix by W^ h . 
We refer to the points of $ either by their location Xi G M. d or by their index i € N. 

2) Networks with both Short- and Selective Long-Range Communication: We start with a disk 
graph G^ h (r) and add long-range edges of length s = ©(r 7 ). The parameter 7 controls the distance 
over which long-range communication occurs: for a given r a node can communicate with nodes 
farther away as 7 — > 0. We add the long edges as follows. 

For some r, 77 > and < 7 < 1, tile the torus with hypercubes of side length rjr. Let c 
denote one of these hypercubes. Along each dimension m = 1, 2, . . . d, let c+ and c~ denote the 
farthest hypercubes from c that are less than distance s/2 away from c along the m th coordinate 
axis, the distance being measured in terms of the separation between their farthest edges. We 
call these hypercubes as the partner hypercubes of c. Figure [2] illustrates the case of d = 2. It 
is easy to see that from any vertex in c, any vertex in c+ and is at a distance of at most 
\f(d — l)rj 2 r 2 + s 2 /4 < for a small enough rj. 

Since r = w(r c ), every tile c contains nrfr 2 nodes a.a.s. Without loss of generality, let x\ be one 
of these nodes. Now add an edge between xi and every vertex in c+ , c~ for m = 1, 2, . . . , d. Thus 
each of these nodes becomes a long-range partner of x\. Repeat this procedure for every node in 
<]?, and count duplicate edges only once. Thus for r = u(r c ), every node in every tile is additionally 
connected to nr 2 \b(0, 1)| + 2dm] 2 r 2 + o(l) nodes a.a.s., i.e., G n is regular asymptotically almost 
surely. Hence an iteration of the form (O on this graph will converge to a uniform distribution 
a.a.s. We define the resultant graph as G l n (r, s, d) = G l n (r, s) and the corresponding update matrix 
by W'. 

Notice that this model adds long edges selectively to each node; it is motivated by the observation 
that a small number of long edges added to a graph can greatly increase its spectral gap, as is the 
case in small-world graphs (cf. |[33l Chap. 14]). We have adapted this idea to a wireless network. 
Instead of adding a single additional edge to a node as is normally the case in abstract graph- 
theoretic models, the inherent broadcast nature of the wireless channel allows a transmitter to 
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Figure 2. An illustration of the geometric random graph models for d = 2. The vertices are shown as black circles. 
In Gn(r), an edge exists between any two nodes iff they are at most at a distance r (the communication range) away 
from each other. This is shown, for example, for the node at the center of the circle. G l n contains all edges in G^. 
Additionally each node communicates with its long-range partners. For example, for each node in the dark gray square, 
all nodes in the lightly shaded squares are long-range partners. These partner squares are chosen such that the distance 
between their farthest edges is less than s/2. Note that there are 4 such partner squares, two along each coordinate axis. 



broadcast its information to several receivers that are in close proximity to one another with very 
little overhead. This allows multiple communication paths to form simultaneously. 

We now describe the communication model, which is a well-accepted model in the study of 
wireless networks. 

C. Communication Model 

We make the following assumptions on the communication model: 

• All edges in G^ 1 and G l n are established by wireless links that operate in the same frequency 
band (normalized to unit bandwidth). 

• Each node encodes its message in K > 1 nats, such that there is negligible quantization error. 
These messages are sent using a point-to-point capacity-achieving AWGN channel code with 
SNR threshold (3 (i.e., R = log(l + /?)). Transmissions are slotted with K/R channel uses 
allowed per slot. 

• There is no fading. The path-loss exponent a is greater than the dimension d of the network, 
so that the interference remains finite a.s. as the network size grows. 

• A packet from node i can be received at j iff the Signal-to-Interference-Ratio (SIR) at node 
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j, SIRjj, is greater than a known constant /3 > 0. Therefore for any sender i and receiver j, 
the link i — > j will be in outage iff 

_ 11 J „ ^ < /3. (5) 

l^kes\{i] W x j x k\\ 

where S is the set of all senders that transmit in the same slot as i. This is the well-known 

interference-limited physical model EUlr . 
• The medium-access scheme is TDMA with spatial re-use. 
Thus the successful formation of each edge in a graph G n is mapped to a successful link formation 
in each direction. Notice that (f5]) models that fact that there is a limit to the number of edges that 
can be formed simultaneously, and consequently on the maximum rate at which a given message- 
passing graph can be established. For a given TDMA protocol, the rate of topology formation is 
thus determined by its schedule length in time-slots. Since we investigate networks in the scaling 
limit, we will investigate the scaling properties of the fastest TDMA protocols that can establish 
a given sequence of random graphs (G n ) (i.e., have the smallest schedule length a.a.s.) 

D. Quantifying the Effective Speed of Convergence 

Note that the mixing time, which is a function of the update matrix W n , the smallest number 
of iterations to converge to an e— ball around the average consensus point. This is different from 
the time taken to taken to converge to this ball with a finite rate of topology formation in each 
iteration. For example, in Fig. \T\ due to interference constraints, the shortest schedule to construct 
this topology has at least three time slots. Hence message-passing iterations using this topology 
can occur no faster than once in every three time slots. 

Thus for a topology G n and an update matrix W n , the smallest effective time to converge is the 
product of the mixing time T m i x (e;W n ) of a topology and the length T*(G n ,f3) of the shortest 
TDMA schedule that constructs the topology in each iteration. We call this the Slot Mixing Time. 
We formally state it below for future reference: 

Definition 14. (Slot Mixing Time) The Slot Mixing Time T slots (G n ) = T slots (G n , W n , /3, e) is 

2 It is possible to derive our scaling results by including both noise and interference in the SINR model at the cost 
of making equations and derivations more cumbersome while distracting from the main message of the paper, which is 
the analysis of the performance with interference. Hence we focus on the interference-limited case. 
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defined as the product 

T s i ots (G n ) ± T mix (e; W n ) • T*(G n ,(3), 

where T m i x (e;W„) is the e— mixing time of iterations using a message-passing graph G n and an 
update matrix W„ and T*(G n ,j3) is the length of the shortest TDM A schedule that constructs G n 
in time slots. 

Notice that in general T s i ots (G n ) depends on the realization of the random graph G n . We will 
analyze the scaling of T s \ t s {G n ) for the families of random geometric graphs described in Section 

EES 

E. Asymptotic Behavior 

From Sections IlII-AI and IIII-CI we notice that the problem involves: 

• The network size n. 

• The short link distance r. 

• The parameter 7 that controls the length of long links. 

We will study the mixing time in an interference-limited network in the regime n — > 00. 

IV. Convergence in Networks with Small Communication Range 

A. Characterizing the Spectral Gap 

The spectral gap for the disk graph is known to be B(r 2 ), independent of network dimension 
Using Cheeger's Inequality (Theorem [121). it was shown that the mixing time of the fastest mixing 
reversible random walk with a uniform distribution on G^ h (r), for polynomial scaling e = l/n s , 
5 > scales as 

T mix (Wf) = G(r- 2 logn). (6) 

It was also shown therein that the mixing time for the natural random walk on G^ 1 is also 
6(r -2 logn). We will now use combine the scaling law for the mixing time with the fastest rate 
of topology formation implied by the communication model in Section IIII-CI 

B. Interference-Limited Topology Formation 

We now prove two results that follow from the assumptions made in Section IIII-CI 
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Proposition 15. Consider a system of n nodes on a d— dimensional torus with a short-range 
communication range r, that communicate using point-to-point codes with SINR threshold f3, with 
a > d being the path-loss exponent. Assuming the short-range network model in Section \III-B1 1 
and the communication model described in \III-C\ the length of the shortest TDMA schedule that 
constructs G^ h has no fewer than C\nr d {5 d l a slots a.a.s., for some positive constant C\. 

Proof: Let S be the set of concurrent transmitters at any given time. Suppose node j is an 
intended receiver of a transmitter i € S. Then z's message is decoded correctly iff ((5]) is satisfied. 
Thus for all k £ S\{i}, 

\\xj-x k \\ > f3 1/a \\xj - Xi\\. (7) 

Clearly this is true even for the farthest intended receiver. It is easy to show that such a receiver 
lies a.a.s. in a ring of inner radius s(l — 5 ) for some fixed 5" > 0. We thus conclude \\xf, — Xj\\ > 
r(l — 5")j3 l / a = r m ; n a.a.s. 

This suggests that any TDMA protocol allowing i to pass a message to its farthest node j needs 
to set up a guard zone of radius no smaller than r m j n around j. Since every node inside this 
guard zone must transmit at least once to form the required message passing graph, any TDMA 
protocol that constructs the message passing graph requires least X^e* ^-xe^nb(0,r miu ) slots. 
Here the indicator la; e <i>nfe(o,r m i„) i s use( ^ to indicate the existence of the point x G $ inside the 
ball 6(0,r m i n ). The summation is over all points x G $. 

For r = u(r c ), each such ball has n\b(0, r min )| = nr d p d l a {l - <5) d |fr(0, 1)| +o(l) > C x nr d ^ d l OL 
a.a.s., where G x = 0.5(1 - 5) d \b{0, 1)|. ■ 

Proposition 16. Consider the network model in Section \III-B1\ and the communication model 
described in \III-C\ The length of the shortest TDMA schedule that constructs G^ h has at most 
C2nr d f3 d / a slots a.a.s., for some positive constant Ci- 

Proof: The proof involves construction of a feasible TDMA schedule whose length is C2nr d f3 d l' 
Let x = Or for some fixed 6 > 1. Consider the lattice L that consists of points on the scaled integer 
lattice xl? that also lie on the torus. In other words, L = xZ 2 n72(n). Partition L into sublattices 
as follows: 

• Lqo — {(ix,jx) G L : i and j are even} 
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• Loi = {(ix,jx) G L : ieven, jodd} 

• Lio — {(ix,jx) G L : iodd, jeven} 

• Ln = {(ix,jx) G L : i and j are odd} 

With each lattice site p G L one can associate the tile t p = p + [0, x] 2 that lies within the torus 
7~2{n). Denote by T^ the set of such tiles associated with each of the points in Ly, i, j = 0, 1. 
For example, Too — { T p '■ P G Loo}- Thus {Ty} partition the torus 7i(ra). 

The idea behind such a partition is to enable spatial re-use. Consider the following four-phase 
MAC protocol consisting of phases 00, 01, 10, 11. In phase ij at most one node from each tile in 
is allowed to transmit. The protocol ensures that each node transmits exactly once. 

The next step is to show that this protocol provides the desired connectivity to each node every 
C2nr 2 f3 2 / a time slots for some positive Ci- To this end, we first show that the interference at each 
intended receiver is bounded from above and can be made smaller than any (3 > by a suitable 
choice of 9. 

Consider one such transmission in phase 00. Let S C Too H V n be the set of all transmitters. 
Consider a transmitting node i in tile r p where p = (0,0), i.e., a tile at the origin. To remain 
feasible, the protocol must satisfy (f5]) for each successful link. For any i,j, k, it is clear that 

\\%k — \\%k *^vll 

^ \\^k W^j *^ll 

— \\%k -Ei || f> 

since \\xj — Xi\\ < r. Therefore for a transmitter at Xi, the interference power at any intended 
receiver at Xj can be upper bounded as 

J2 l|a*-Zjir Q < Yl {\\*k - Xi\\ - r)~ a , (8) 

kdS\{i} k£S\{i} 

where the right hand side is independent of j. By the design of the protocol, an interferer k for 
any intended receiver of the message from i must lie in a tile distinct from 77 0i o)- Moreover, such 
a tile should lie within Too; thus the protocol imposes a lower bound on the minimum distance 
between any two concurrent transmitters. Using geometrical arguments (see Figure [3]), the right 
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Figure 3. Geometric reasoning underlying the proof of Proposition \TE\ The location of a typical transmit- 
ter in Tofi and one of its intended receivers is shown. The nearest interferers and their respective tiles are 
T2,o, T2,2) To,2, T-2,2, T-2,0, T-2,-2, To, -2, T2,-2. The signal power from any of one these interferers at the intended 
receiver is no larger than that received from the closest interferer allowed by the protocol. The protocol ensures that this 
nearest distance is no smaller than x = dr. 



hand side of ([8]) is upper bounded as 



(\\ x k-Xi\\ ~r) a 

k<=S\{i} 



it — r) 



< J^8Z((2Z - 1)07 
1=1 

oo 

= 8r- Q ^Z((2Z-l)fl-ir a 
1=1 

(oo 
((9 - l)- a + ^ l((2l - 1)9 - 9)-° 
1=2 

(oo > 
{9 - l)- a + 2~ a 9~ a J2 K l ~ l)~ a 

<£r- a {9-iy 



1=2 



(9) 



for some fixed £ > 0, since the sum converges for a > 2 (in general, for a > d, as assumed in 
the communication model). The SIR condition © is guaranteed to be satisfied at every intended 
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j, if 9 is chosen such that 

r ~ot 

ir~ a (e - iy a - P 

For a suitable choice of £, we can set = 10(£/3)°. 

For s = oj(s c ), the number of nodes in each tile is nx 2 + o(l) a.a.s. Hence as n — > oo, the 
protocol constructed requires 4nx 2 + o(l) < C^ns 2 f$ 2 l OL transmissions almost surely to establish 
the necessary connectivity to each node in the network, where C2 > 400£ 2 / a . By optimality, the 
number of slots T* in the shortest TDMA schedule cannot exceed this number. ■ 

The results from Propositions [15] and [16] lead to the following corollary. 

Corollary 17. IfT*{G^{r), f$) denotes the length of the shortest TDMA schedule, then as n — > 00, 
a.a.s.: 

1) For fixed j3, T*(G° h ,/3) = Q(nr d ). 

2) When /3 = (3(n) = T*{G^,p) = n(nr d {(3(n)}^). 

Proof: Claim 1 is evident from the results of Propositions \T5\ and [16] 
For some constants C\ and C2, we have from Propositions [TBI and [T6] a.a.s. for large n and a 
fixed P, 

dnr d ^ a < T*{G* h ,/3) < C 2 nr d ^ a . 

Since C\ (but not Ci) is independent of /3, we can write for n — > 00, when /3 = f3(n) = Sl(l) 
T*(G* h ) = n(nr d ^(n)). ■ 

If all nodes had independent point-to-point channels between one another, the rate of topology 
formation would be 0(1). For a wireless channel, however, Corollary [T7] suggests that it requires 
e{l/nr d ) even with optimum spatial re-use. Thus better-connected disk graphs are penalized by a 
smaller rate of topology formation. We combine the mixing time result ([6]) to examine the scaling 
law for the effective time necessary for convergence in the next section. 

C. Rate of Convergence 

1 ) Slot Mixing Time: We now analyze the asymptotic convergence behavior of the distributed 
averaging algorithm (O in a dense network as n — > 00. From the earlier sections, we know the 
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scaling laws for this regime for: 

1) The number of iterations necessary to a.s. reach an e— ball (from (O). 

2) The shortest TDMA schedule length to a.s. realize in each iteration (from Corollary [TTTl. 
Thus from Definition [Q] for fixed j3, the Slot Mixing Time scales as 



From Proposition Q3] and the Gaussian signaling assumption, when we also allow /3 to depend 
on n such that /3(n) = f2(l), the time to reach this ball scales as 

/ R(n)d/a \ 

^ / d— 9 E \ 



where R(n) = log(l + /3(n)). 

2) Choice of Communication Range: For a fixed /3 the mixing time in (fTOb scales polynomially 
in r for d > 1. Interestingly, for d = 1, the time slots to mix scales as the inverse of r. This 
suggests that increasing r can improve the rate of convergence. For d = 2, however, this quantity 
scales independently of r, suggesting that these two effects exactly cancel each other, a rather 
non-intuitive result. For higher dimensions, the scaling law has a positive exponent in r — implying 
that the increasing r can actually slow down mixing. 

This dependence on network dimension can be understood as follows. If the network is one- 
dimensional, although a transmitter is an isotropic radiator, its effect on the network is seen only 
along the line [0, 1]. Although the throughput provided by the optimal TDMA protocol only scales 
as 6(n _1 r _1 ) for a given f} from Corollary [T7J the spectral gap scales as 0(r~ 2 ), offsetting 
this loss. In d— dimensions, however, while the the fastest rate of topology formation scales as 
@(n~ 1 r~ d ), the spectral gap only scales as G(r~ 2 ). As a result, improving spatial re-use can 
become more important than increasing connectivity. 

3 ) Effect of Increasing Transmission Rate: On the one hand, higher transmission rate reduces 
the packet transmission time; on the other, it also restricts spatial re-use. Clearly the benefit of 
smaller packet transmission times can be outweighed by reduced spatial re-use for large rates R. 



T s]ots (G* h ) = T mix (Wf ) • T*(Gt,P) = ®{nr d - 2 \ogn) 



(10) 



slots a.a.s., for e = 1/n 5 . 




a.a.s. 
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V. Convergence in Networks with Selective Long-Range Connectivity 

A. Scaling of the Spectral Gap 

To derive the scaling law for the mixing time, we need to find the scaling of the spectral gap of 
G n . As we will see, deriving the scaling law for the conductance of G\ is sufficient to establish 
the scaling of the spectral gap. 

Proposition 18. The conductance of G\ with edge weights determined by is G(r 7 ) a.a.s., for 
d = 1,2,... 

Proof: We adopt a modified version of the proof in |34l . From (01) we know that 

, . Q(s,s) 

h = mm ; — — . 

ScSl,w(S)<l/2 TT*(S) 

By the symmetry in G l n induced by the construction in Section IIII-B2L it can be shown using 
arguments similar to |[34l Appendix G] that the minimum occurs for tt*(S) = 1/2, and that the 
minimizing cut (S, S) is a hyperplane dividing the torus into two halves. Without loss of generality, 
define S = $n {[0,1/2) x [0,1]}. 

Also for the natural random walk, each edge weight is J- = (-^) (for d dimensions, 
0(n _1 r~ d )), and the equilibrium distribution is ©(-). It is thus sufficient to count the number 
of edges traversing this cut. The number of short edges was shown in |[34l to be 0(n 2 r 3 ) (for d 
dimensions @(n 2 r d+1 )). Observe that every node in a square of side rjr has Anrfr 2 long-range 
partners. One quarter of these edges traverse the cut (S,S); hence the potential number of long 
edges that can traverse the cut from a given square is nr/ 2 r 2 x nifr 2 = B(n 2 r 4 ). Since each 
edge has length at least s/2 — 2r\r = 0(r 7 ) (since s = 0(r 7 ) and < 7 < 1), which is at 
most s, it is clear that 0(r 7_1 ) squares from the cut will contribute to the edges that traverse the 
cut (see Fig. |4]). Multiplying this result by the number of rows 0(r _1 ) of such squares, the total 
number of long edges traversing the cut will be 0(n 2 r 4 x r 7_1 x r _1 ) = 0(n 2 r 2+7 ) (for general 
d, Q(n 2 r 2d x r 7_1 x r ^ d + 1 ^ = 0(n 2 r d+7 )). Counting both the short and long edges, we have in 
d dimensions, 

(„2„rf+l 1 r .2„d+7\ 
n t r : ) 

= G(r 7 ), 
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Figure 4. The geometry behind the proof of Proposition [T8] for d = 2. The tiling used for the construction of G n 
is overlaid. By the symmetry induced by the construction, the set S C V„ for which Q(S, S)/tt*(S) is minimized 
corresponds to the left-half of the torus as labeled (it can be argued that this set will have the smallest weighted flow for 
a given frequency of steady-state occupancy). Since the stationary distribution for this set is 1 /2, finding the scaling law 
for the number of edges that traverse the cut is sufficient to provide a corresponding scaling result for the conductance. 
For the short-range communication graph G„ (i.e., the disk graph whose edge length is 0(r)) only nodes from a finite 
number of squares from the tiling in either direction from the cut contribute to these edges. For long edges of length 
0(r 7 ), a positive fraction of the nodes from 0(r 7 /r) squares on either side will contribute to these edges. Since there 
are <9(l/r) such rows of squares, the proof lies in finding the scaling law for the number of edges that traverse the cut. 



since 7 < 1. 

Notice that if a node were allowed to have only a finite number of long-range partners, the 
contribution of long-edges towards conductance is smaller, without significant interference-reducing 
benefits. We elaborate on this point in Section IV-C3I ■ 

We can infer the following from the above result: 

Corollary 19. The spectral gap of G\ is S7(r 27 ) and 0(r 7 ). 

Proof: From the lower bound in Theorem [T3l we have 1 — \ii = 17 (r 27 ). From the upper 
bound from the same theorem, we have 1 — ^2 = 0(r 7 ). ■ 
As noted in Section IIII-B2I the distance between any two (graph-theoretic) neighbors is no 
more than s/y/2. Thus every edge in G l n (r,s,d) is also present in the disk graph G^ h (s/2), 
i.e., G\(r, s,d) C G^ h (s/2). Hence a reversible random walk on G l n with a uniform equilibrium 
distribution can mix no faster than the fastest mixing such random walk on G^(s/2). This key 
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observation allows us to use a known result that follows from (21 Thm. 8]: 

Theorem 20. The spectral gap corresponding to the transition probability matrix of the fastest 
mixing reversible random walk on G^ h (r 7 ) with a uniform equilibrium distribution is 0(r 27 ) a.a.s. 

Since mixing time decreases with spectral gap, from Theorem l20l we conclude that the spectral 
gap of G l n is 0(r 27 ). But we know from Corollary [19] that this gap is also J7(r 27 ). Thus we 
conclude that the spectral gap of G l n is 0(r 27 ), which is formally stated as a theorem: 

Theorem 21. The spectral gap of the natural random walk on G l n is 0(r 27 ). 

This result suggests that the improvement in spectral gap from an increased communication radius 
from r to r 7 can also be achieved (in the scaling sense) by allowing each node to communicate 
with a selected number of nodes at a distance 0(r 7 ). 

However, as we shall discuss in the next section, such connectivity comes at a price of a lowered 
rate of topology formation. We find that this loss (as measured by the shortest TDMA schedule 
length) must be no smaller than the number of nodes in the largest exclusion zone created in the 
network. Since the longest link distance in both the disk graph G^ h (s/2) and G l n are of the same 
order, the similarity in the expressions for the spectral gap scaling law suggests that we should 
expect the same dependence on network dimension as in (ITOb . 

B. Convergence with Interference 

We will derive bounds for the shortest feasible TDMA schedule for G x n . In the spirit of the 
earlier proofs, the lower bound follows from the feasibility constraint (i.e., the schedule constructs 
the desired message passing graph while satisfying the SINR constraint), while the upper bound 
is found by bounding the length of the optimum schedule by that of a specific feasible schedule. 
These results are presented in the following. 

Proposition 22. For a given f3, a feasible schedule for G l n has C^nr' ld l3 d ^ a slots a.a.s. for some 
positive constant C3. Furthermore, for a given j3, this length scales as £l{nr" td ) slots a.a.s. 

Proof: We prove this result for d = 2; the proof for d 7^ 2 is similar. From the system model, 
it is clear that a TDMA protocol that constructs G l n must form at least one link of distance at 
least s/2 — 2rjr. Since s = 0(r 7 ) (i.e., s scales "much slower" than r), at large enough n, the 
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protocol must create an exclusion zone of radius of at least s/4 in the network at least once. All 
nodes within this exclusion zone must transmit at least once. But s = td{rj), which implies we 
operate the supercritical regime. From a similar argument as in Proposition [15] we can assert that 
any feasible TDMA protocol must have at least C^ns d f3 d / a slots where C3 is a positive constant. 
The scaling law for this length follows from the scaling of s with r. ■ 

Proposition 23. For a given (3, the length of the shortest feasible schedule for G\ is no more than 
C4 : (ns d j3 d l a ) slots a.a.s., for some positive constant C4, For a given (3, this upper bound scales 
as 0(nr d ) a.a.s. 

Proof: Consider any TDMA protocol that allows each node to communicate with every node 
within a distance s. Clearly this protocol will also construct G\ and is hence feasible. As in 
Proposition [T6l we construct such a four-phase (for d = 2, in general a 2d phase) TDMA protocol 
that operates on a tiling of the torus with squares of side G(s). Using an argument similar to 
Proposition [T6l it is clear that the spatial re-use can be adjusted to construct the graph in C^ns d (3 d ^ a 
slots a.a.s. for some constant C4 > 0. Using s = 0(r 7 ) we get the scaling law. ■ 

Corollary 24. As n — > 00, the shortest feasible schedule for G l n has T*(G l n ,(3) = Q(nr ld ) slots 
a.a.s., for a fixed /3. If we also let (3 = (3(n) = 0(1), T*(G l n , (3{n)) = Q(nrt d {(3(n)}^). 

Proof: Follows from Propositions [22] and [23] ■ 

C. Rate of Convergence with Sparse Long-Range Connectivity 

We repeat the analysis in Section IIV-CI to study the benefit of sparse long-range connectivity for 
a large number of nodes. From this analysis, we derive a result analogous to (fTOl ) for the long-range 
model. We use this result to discuss the impact of increased communication range. 

1) Slot Mixing Time: From Theorem |2T1 the spectral gap of G\ t scales as 0(r 27 ). Conse- 
quently, from the mixing time bounds in Theorem [T21 we conclude that the mixing time with 
scales as 

T mix (W') = e(r 2 nogn) a.a.s. (11) 

iterations for e = \/n s . On the other hand, from Corollary [24] the shortest TDMA schedule that 
realizes G l n scales as 0(nr d7 ) slots. 
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Multiplying T^CW^) and T*(G l n ,(3) we obtain a scaling law analogous to (TTOb for a network 
with sparse long links. We state this result as a proposition: 

Proposition 25. As n — > oo, for e = l/n s and with the shortest feasible TDM A schedule, the slot 
mixing time of natural random walks on a sequence of random graphs (G>J on a d— dimensional 
torus scales as 



where r is the short range communication radius, long links are 0(r 7 ) for some < 7 < 1, and 
nodes use point-to-point capacity-achieving AWGN channel codes with SNR threshold ft. 

From Proposition [15] and the Gaussian signaling assumption in Section IIII-CI when we also let 
/3 = /3(n) = Sl(l), the time to reach this ball scales as 

/ p R(n)d/a \ 



where R(n) = log(l + (3{n)). 

2 ) Impact of Increasing Communication Range on the Convergence Speed: For a fixed f3, from 
(fT2l) we notice that as with short-range links, the slot mixing time scales polynomially in r for 
d > 1. The parameter s that controls the distance of long-range communication enters the scaling 
law through r 7 , since s = 0(r 7 ). By comparing (fT2)) and (TTOb it is clear that its role is identical 
to that of r in (ITOb . Thus we expect the impact of increased communication range to have the 
same dependence of the network dimension as in ( fTOb . From Proposition [22] and an analysis 
similar to Section ITV-C3I it follows that while one-dimensional networks can converge faster from 
an increased communication range despite greater interference, the convergence speed of two- 
dimensional networks scales independently of the communication range. In higher-dimensions 
the increased interference from a larger communication range can actually lower the rate of 
convergence. 

From the model s = G(r 7 ), which implies that a larger s can result from either a larger 
r (communicating with more nearby nodes) or a smaller 7 (communicating with nodes farther 
away). In either case we find that (fT2l scales faster than (fTOl) : when interference is accounted for, 
selective long-range communications do not improve the rate of convergence. 




(12) 




a.a.s. 
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3 ) The Importance of Long-Range Clusters: Here we discuss the importance of forming long 
links from a node to a cluster of nodes. Briefly, we argue that adding only a few long edges to 
a given node does not take full advantage of the broadcast nature of the wireless medium: while 
these fewer long edges to a node reduce the spectral gap (and can increase mixing time as a result 
of Theorem [T2l ). forming long links from a node to a cluster of nodes causes approximately the 
same interference as forming a point-to-point link of the same distance. Hence in lowering this 
cluster size, we do not gain from reduced interference, but can only worsen the spectral gap. So, 
when interference is factored in, allowing a node to talk to a far-off cluster rather than a few far-off 
nodes allows faster mixing for the same level of interference. 

The effect of forming clusters is captured in the long-range model in Section IIII-B2I which adds 
all the nodes from a partner hypercube as long-range partners. This maximizes the number of long 
edges contributed by each hypercube and results in the scaling law in Proposition [18] This is key 
to deriving Theorem |2T1 

Suppose we modify the way long edges are added in this model by constructing a new graph 
G' n (s) by assigning each node only p n = 0{nr d ) long-range partners in each partner hypercube. 
Evidently G' n (s) is regular a.a.s., with node degree nr d \b(0, 1)| + 2dp n + o(l); so iterations as in 
£[]) converge to the average consensus point a.a.s. Denote the corresponding update matrix by W^. 
We will now examine the scaling of the spectral gap of W^. 

Following the steps in the proof of Proposition [T8l the (edge-weighted) conductance of G' is 
0(r + 7° '(p n /nr d )). Since p n = 0{nr d ), the conductance can scale no faster than r 7 . 

Therefore, unlike in the case with G l n (s), exploiting the inclusion G' n (s) C G^ h (s/2) is not 
enough to conclude the spectral gap of to be G(r 27 ). But the inclusion does confirm the 
spectral gap to be 0(r 27 ). Hence, as one would expect, iterations of the form CO) can converge no 
faster with than with W^. 

However, in the scaling limit, the interference resulting from the construction of G l n or G' n are 
the same: it is obvious from Propositions [22] and [23] that the shortest feasible TDMA schedule 
for G' n is also Q(nr ld ) slots. We thus conclude that maximizing the cluster size to include all 
the nodes inside a partner hypercube speeds up convergence for the same level of interference. 
However, when this cluster is enlarged to include all nodes within a radius s, we have a disk graph 
with radius s. From the results in the previous sections, it is clear that the interference penalty to 
realize this larger disk graph scales similarly but is certainly larger than that of G l n (s), which has 
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only selective long-range links. 

VI. Conclusions 

We analyzed the convergence rate of average consensus algorithms in the scaling limit of dense 
wireless networks by combining results from Markov chain theory, random geometric graphs, 
and wireless networks. When messages in a topology are exchanged over wireless links, the 
impact of a greater communication range depends crucially on the network dimension. Increased 
communication range can speed up convergence in one-dimensional networks despite greater 
interference. In two-dimensional networks, the convergence speed scales independently of the 
communication range. In three- (and higher-) dimensional networks, forming long links can actually 
slow down convergence. These results hold whether each node only communicates over short links, 
or, additionally, with a cluster of far-away nodes. 

These results greatly differ from many optimistic results about the benefit of long-range con- 
nectivity obtained by analyzing the consensus problem in an abstract graph-theoretic setting. Our 
results underline the need to accurately account for the cost of interference in designing fast- 
converging topologies for the average consensus algorithm, or for distributed signal processing 
problems, in general. 
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